Faster Fourier Transforms via Automatic Program Specialization
نویسنده
چکیده
Because of its wide applicability many e cient implementations of the Fast Fourier Transform have been developed We propose that an e cient implemen tation can be produced automatically and reliably by partial evaluation Partial evaluation of an unoptimized implementation produces a speedup of over times The automatically generated result of partial evaluation has performance com parable to or exceeding that produced by a variety of hand optimizations We analyze the bene ts of partial evaluation at both compile time and run time focusing on compiler issues that a ect the performance of the specialized pro gram
منابع مشابه
A Polymorphic Radix- n Framework for Fast Fourier Transforms
We provide a polymorphic framework for radix-n Fast Fourier Transforms (FFTs) where all known kinds of monomoporhic radix-n algorithms can be obtained by specialization. The framework is mathematically based on the Cooley-Tukey mapping, and implemented as a C++ template meta-program. Avoiding run-time overhead, all specializations are performed statically.
متن کاملAccelerating Object-Oriented Simulation via Automatic Program Specialization
Object-oriented simulations in an object-oriented environment are easier to construct and maintain than conventionally programmed simulations. Unfortunately, they are also slower because of message passing and other runtime overhead. We have developed an automatic program transformer that solves the eeciency problem for a large class of simulation programs. It automatically constructs an eecien...
متن کاملFast, Optimized Sun RPC Using Automatic Program Specialization
Fast remote procedure call (RPC) is a major concern for distributed systems. Many studies aimed at efficient RPC consist of either new implementations of the RPC paradigm or manual optimization of critical sections of the code. This paper presents an experiment that achieves automatic optimization of an existing, commercial RPC implementation, namely the Sun RPC. The optimized Sun RPC is obtain...
متن کاملFaster Run-time Specialized Code using Data Specialization
Run-time specialization is a technique that optimizes a program based on run-time information. In this context, specialization time must be constrained, limiting the possibility to further optimize the specialized code. We present a low-cost methodology for improving the code generated by a run-time specializer. This result is acheived by combining run-time specialization with another form of a...
متن کاملAutomatic Performance Tuning for Fast Fourier Transforms
In this paper we discuss architecture-specific performance tuning for fast Fourier transforms (FFTs) implemented in the UHFFT library. The UHFFT library is an adaptive and portable software library for FFTs developed by the authors. We present the optimization methods used at different levels, starting with the algorithm selection used for the library code generation and ending with the actual ...
متن کامل